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AESTRACT 

The efficacy of use of the Raven Progressive Matrices 
Test (RPM) in the selection of gifted children from traditionally 
underrepresented groups was investigated in a large-scale study with 
a diverse population. A total of 16,985 subjects were given the Raven 
Progressive Matrices Test. These included 22.7 percent Latinos, 37 
percent Whites, 14 percent African-Americans, 2.8 percent Asians, 8.4 
percent Filipinos, and 5.6 percent Indochinese, each of whom had been 
identified as potentially gifted on the basis of a case study 
analysis by a school psychologist. The sample of children certified 
as gifted based on Raven performance was compared to a group 
certified as gifted based on individual administrations of the 
Wechsler Intelligence Scale for Children - Revised (WISC-R) , and to 
actual enrollment ratios of cultural subgroups for the school 
district. Findings support the position that the RPM is a more 
equitable test than the WISC-R for evaluating students from 
ethnically diverse populations, as well as for ensuring greater 
gender equity. (PB) 
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Abstract 



The efficacy of use of the Standard Raven Progressive Matrices 
Test (RPM) in the selection of gifted children from traditionally 
underrepresented groups was investigated in a large-scale study with a 
diverse population. A total of 16,985 Raven subjects included 22.7% Latinos, 
37% Whites, 14% African- American, 2.8% Asian, 8.4% Filipinos, and 5.6% 
Indochinese, each of whom had been identified potentially gifted based on 
a case study analysis by a school psychologist. The resultant sample of 
children certified gifted based on Raven performance was compared with 
a group certified gifted based on individual admiriistrations of the Wechsler 
Intelligence Scale for Children - Revised (WISC-R). The Raven gifted sample 
was also compared to actual enrollment ratios for the school district to 
ascertain if equity in representation could be achieved for gifted programs 
using the Raven. Although chi square results i-. iicated that use of the RPM 
as opposed to the WISC-R led to increased equity for all groups, full equity 
was not achieved. When compared to the district population, the Raven- 
selected group showed underrepresentation for Latinos and African- 
Americans and overselection for Whites, Asians, and Filipinos. Chi square 
comparisons based on the expectation of equal gender representation 
revealed that the WISC-R disproportionately overselected boys, while the 
RPM showed no gender differences. The findings support the position 
that the RPM is a more equitable test than the WISC-R for ethnically diverse 
gifted populations and for girls. 



UseoftheRavenProgressi^ 

Professionals in the Held 
developing alternatives to traditionally ^StomSdWonally underrepresented groups such as 
alternatives are particularly needed for ^^^^ ^eatedly mentioned in the literature 
African-Americar*and^ ^ 1986) . A number of 

is the Raven Progressive Matrices (RPM) Jest ^ measure of cognitive ab u lt y wl th 
investigators have pointed to the ^^^Xonally underrepresented and culturally drverse 

The standard form of the RPM ^ 
sets of 12 designs each. Within each set of 12, £ P^^^ a ^ piece. The student's 
the five sets is progressively more d^ult Each J^*^ among six to eight alternatives. Correct 

sr e ^ as - size ' reduc or 

complexity, and number of elements. 

Because RPM stimuli are visual!, JP--*^ Xg^MW 
perception or spatial reasoning. ^^^^^^Z^^^ci^t^ 
have noted, "The Raven is as close to a ^^^AbsSw md Colleagues have shown using radex 
of specie content acquisition as is a^ of ^ al M ce (Marshalek 

and hierarchical models, the RPM is * e 1984) . ^ a me asure of general intelligence the 

Lohman, & Snow, 1983; Snow, Kyllonen, & Sough me stimuli themselves are completely 
P^Mcorrelateshighlyw^ 

nonverbal (Carpenter, Just, & Shell, W ^ the entire brain is involved in solving 

produce computer generated images ot the brain, £ave s hemisphere, the left temporal 

RPM problems, with the three most used areas being toe ngn Browning, & Buchsbaum 

'""^^-■^J" ----- ^t^reS^ 

*> d a « cn,i0n .f '"T Xt^eTsed^n acquit knowledge, specific 
OTM^blemsdcLnotdepen^^ 

Surel experiences, or reading .Mr* ^^ttrmarion, without relying extensive 1, .or tan 
the ability to reason and solve P' ob ' c ^ mvol ™l„ ^ schooling or previous experience. In sum, 
Splici. SL of declarative SSJS^fLi-S of linguislic abUiry. 1. uses 

ii^r;^ *— * — ■ 19861 - Nava, ° 

students (Sidles & MacAvoy, 1987). ^ ^ ^ ^ ^ 

As Raven(1989)has noted, the Raver ^^^^S««J 
published psychological studies (see Court, lWjCwrt £ * , ^ begn usfid dely m 



aeed 5 to 18 were chosen to be representative of school districts across the United States in approximately 
30 normine studies. Ethnicity and socioeconomic factors made independent contributions to the 
variance Ethnic differences, which were attributed to differences in birth weight, infant mortality, and 
the incidence of serious childhood illness, showed a decline compared to earlier reports (Burciaga, 
1973- Hoffman 1983- Jensen, 1980). There were, for example, no major Hispanic/White differences in 
the Ontario-Montclair School District of California. Moreover, the RPM was found to have equal 
predictive validity within each group (Hoffman 1983, 1986). 

It should be noted that while ethnic differences may be declining, there remain differences in 
the mean scores as a function of ethnicity as well as socioeconomic background. Nevertheless, the 
question remains as to whether the RPM can be used to achieve a more equitable selection of students 
man can be obtained with more widely used traditional tests, such as the Wechsler Intelligence Scales. 

In the present study we present the results of a Jacob Javits grant whose purpose was to evaluate 
the efficacy of the RPM in the selection of traditionally underrepresented groups for gifted programs. 
The data were collected over a three year period and include an archival data pool of over 5,000 
administrations of the Wechsler Intelligence Scale to children from 8 major ethnic backgrounds. 



Method 



The subjects were 16,985 children who were referred and evaluated for the gifted program at 
San Dieeo City schools between the Fall of 1991 and the Spring of 1993. Students were classified into 
one of eight ethnic backgrounds based on self-report of the parent as follows: 3,8M Utoo/ Hispanic, 
6,286 White,2,389 African-American, 483 Asian, 75 Native-American, 104 Pa ^ ckl ^ e ^^P m0 ' 
and 958 Indochinese. There were 1407 classified as "Other". Of the 16,985 subjects 51.5% (8,740) were 
female 48.5% (8,245) male. The distribution by grade level was as follows: 24 first-, 7,664 second-, 
1,467 third-, 819 fourth-, 3,737 fifth-, 748 sixth-, 2,122 seventh-, 263 eighth-, and 90 ninth-graders. There 
were 51 cases where data on grade was missing. 

As a comparison sample, the f iles of all children evaluated for gif tedness in the San Diego City 
School System between 1984 and 1990 were examined. During this time period, the district had used 
the WISC-R as the primary tool for identifying giftedness. A total of 9315 students had been given the 
WISC-R during this time period. 

A second point of comparison was based on actual enrollment figures by ethnic background 
durina the course of the study. The average enrollment ratio in the district as a whole by ethnic 

bXo-^^ 

American, 2.3% Asian, 0.6% Native-American, 0.7% Pacific Islander, 8% Filipino, and 7.7% Indochinese. 

^"'children were given either a WISC-R (entire 1984-1990 sample) or a Standard Raven* entire 
1990-1993 sample) by a district school psychologist. The WISC-Rs were individually administered, tite 
Zen was Soup ^ administered. As a part of the evaluation process the school psychologists conducted 
a cas s*d ^analysis of each child to evaluate for the presence of risk factors and level of achievement 
JseTsaccuzzo Johnson, & Russell, 1992). Five risk factors were considered. These were: cultural/ 
an^^economS emotional, environmental, and health. Achievement was evaluated in terms of 
California Test of Basic Skills (CTBS) or the Abbreviated Stanford Achievement 

Test (ASAT). 

Children were certified as gifted if they obtained a Full Scale IQ of 130 on the WISC-R or an IQ 
equivalen^^^^ 

U S Norms reported by Raven et al. (1986). In addition, children who had two or more of the nsk 
factors w^ certified as gifted if they had a Full Scale WISC-R IQ of 120 or Raven IQ equivalent of 120. 
SXSSwEo obfained a WISC-R or Raven score of 3 standard deviations above the mean were 
placed in a special "Seminar" program. 



Results 



Fieure 1 illustrates the ethnic composition of the district for all eight ethnic backgrounds 
comparedto all children who were certified as gifted with the W1SC-R (entire 1984-1990 sample and to 
who were certified as gifted with the Raven (entire 1990-1993 sample). Inspector icf Figure 
1 reveals that the RPM led to increased equity for all ethnic groups. For example, only about 20 h of the 
expected number of Latino/Hispanic were selected with the WISC-R, while about 80% of the expected 
were selected using the RPM. For the Whites, about 200% of the expected were selected with theWBC- 
R whUeonly about 120% of the expected were selected with the RPM. Pacific Islanders, Native- 
Wra ind Indochinese were all I greatly underselected by the WISC-R, but selected according to 
mT ^aton based on their numbe . with the Raven. Filipinos went from about 60% o expectato" 
wHh the WISC-R to about 120% with the Raven. African-Americans went from about 40% ofthe 
TxpecTaton to over 60%. Thus, the RPM provided a more equitable distribution for all ethnic 
backgrounds. 



Figure 3. Ethnic composition of those certified gifted on the basis of WISC-R scores (1984-1990) 
versus Raven's Progressive Matrices scores (1990-1993). 
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equity TaU groups but Native-Americans (where the n was small) and Asians. 



Table 1. 



Chi Square results using children certified with the WISC-R as the expected 
and children certified with tlie Raven as the observed 



Ethnicity 


df 


n (Expected) 


n (Observed) 


Chi Square 


Latino/Hispanic 




413 


1304 


1922.23 *** 


White 




5535 


3441 


792.20*** 


African-American 




454 


714 


148.90*** 


Asian 




260 


306 


8.14 


Native American 




20 


36 


12.80 


Pacific Islander 




10 


50 


160.00*** 


Filipino 




321 


713 


478.70*** 


Indochinese 




112 


513 


1435.72*** 



* p < .05 
*** p < .001 



Despite the substantial improvement in equity when the RPM was used, inspection of Figure 1 
indicates that full equity was not achieved. To verify this observation statistically Chi Squares were 
again computed using the proportionate number of children in each ethnic background as the expected 
compared to the number actually certified with the Raven as the observed. 

Table 2 presents a summary of these findings and reveals significant discrepancies for all groups 
except Native-Americans, Pacific Islanders, and Indochinese. Latino/Hispanics and African-Americans 
were significantly underselected compared to the expectation while Whites, Asians, and Filipinos were 
significantly overselected. 



Ti'ble 2. Chi Square results comparing the actual proportion of children in each ethnic 
background with the number actually certified with the Raven 



Ethnicity 

La tino / Hispanic 
White 

African-American 
Asian 

Native American 
Pacific Islander 
Filipino 
Indochinese 



df 



n (Expected) 


n (Observed) 


Chi Square 


2009 


1351 


215.51*** 


2743 


3568 


244.69*** 


1200 


766 


156.96*** 


170 


319 


130.59*** 


41 


39 


0.10 


55 


48 


0.89 


595 


739 


34.85*** 


569 


532 


2.41 



p<.05 
p<.01 
p<.001 
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To examine possible ethnic differences on the Raven independently of risk factors, we used the 
following system. First, we considered only those diildren who scored in the 98th percentile or better 
on the Raven. Next, we determined the number of children in the school district in ea A ethmcbadcground 
that would represent 2 percent of that group. For example, given the total number of Utoo/Hisparucs 
in the district, 905 would be in the top two percent. The actual number of children m the top two 
percent for each ethnic background was used as the expected, while the number of children in each 
background who actually scored in the upper 2% (i.e., 98th percen^e or better) was u^ as the observed 
in a Series of Chi Square analyses. Table 3 summarizes these analyses and shows significant discrepancies 
for four groups: Latino /Hispanics and African-Americans, who were underselected; and Whites and 
Asians who were overselected compared to the expectation. 



Table 3. Chi Square results comparing the actual number of children in tte top 2% of each ethnic 
background with the number who actually scored in the upper 2% 



Ethnicity 

Latino /Hispanics 
Whites 

African-Americans 
Asians 

Native Americans 
Pacific Islanders 
Filipinos 
Indochinese 



df 



n (Expected) 


n (Observed) 


Chi Square 


905 


379 


305.72*" 


1238 


2009 


480.16*" 


541 


209 


203.74*** 


77 


173 


119.69*" 


18 


21 


0.50 


25 


12 


6.76 


268 


316 


8.59 


256 


199 


12.69 



* p < .05 
** p < .01 
*** p < .001 



To investigate the possibility of gender differences in WISC-R and Raven performance, the 
number of boys aSd girls who actually scored in the 98th percentile and above on each test in each 

canbeLn in Table 4, there were no significant differences in performance for boys and gxrls of any 
ethnic background on the Raven. 



Table 4. 



Chi Square results comparing the number of boys and girls of each ethnic 
background who actually scored in the top two percent on the Raven 



Ethnicity 

Latino/Hispanics 
Whites 

African-Americans 
Asians 

Native Americans 
Pacific Islanders 
Filipinos 
Indochinese 



df 



p<.05 
p<.01 
p<.001 



Total n 


Boys (Observed) 


Chi Square 


374 


181 


0.19 


2002 


1057 


3.13 


206 


116 


1.64 


172 


79 


0.57 


21 


12 


0.21 


12 


6 


0 


315 


170 


0.99 


198 


100 


0.01 
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For the WISC-R, by contrast, significantly more White boys (2632) than girls (2272) achieved 
an IQ score in the top two percent (Table 5). Moreover, the trend was in favor of boys on the WISC-R 
for every ethnic group except Pacific Islanders. 



Table 5. Chi Square results comparing the number of boys and girls of each ethnic 
background who actually scored in the top two percent on the WISC-R 



Ethnicity 




n 


Boys (Observed) 


Chi Square 


Latino /Hispanic 




265 


152 


2.87 


White 




4904 


2630 


13.21*** 


African-American 




252 


129 


0.07 


Asian 




202 


109 


0.63 


Native American 




13 


9 


0.96 


Pacific Islander 




9 


3 


0.50 


Filipino 




182 


107 


2.81 


Indochinese 




61 


34 


0.40 



* p < .05 

** p<m 
*** p < .001 



In order to examine performance of individuals at the highest level of ability (i.e., three standard 
deviations above the mean), it was necessary to generate local norms, since the norms provided in the 
manual only go up to the 99th percentile. In constructing local norms for the 99.1 through 99.9 percentile, 
we used the following procedure. Based on the table of smoothed North American norms provided by 
Raven, et al. (1986), we selected all those children in our study who had obtained a raw score in the 99th 
percentile for each age range in the table. We then examined the frequency distribution of raw scores 
for each age range and attempted to break the scores down into ten groups occurring with equal 
frequency, representing the 99.0 through 99.9 percentile. 

In the analyses that followed, the local norms (Guertin, Johnson, Saccuzzo, & Lopez, 1992) 
were used to identify students who scored three standard deviations above the mean (i.e., above the 
99.87 percentile). Figure 2 illustrates the proportion of students from the entire 1990-1993 sample, by 
ethnic background, who obtained scores at the 99.9 percentile on the local Raven norms versus the 
proportion, by ethnic background, who obtained WISC-R Full Scale IQ's of 145 or above. As inspection 
of Figure 2 shows, there was an increase in the representation of children selected for the schools' very 
gifted "Seminar" progTam for all ethnic backgrounds except for the Whites, who represent about 35 
percent of the district. With the WISC-R, Whites represented more than 80% of the children selected for 
the "Seminar" program. With the Raven, Whites represented less than 60%. 
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Figure 2. Ethnic composition of the 99.9th percentile: WISC-R versus 
Raven's Standard Progressive Matrices 



100 r 




To demonstrate statistically the superiority of the RPM over Une WISC-R for selection at the 
highest level of ability, Chi Square analyses were computed for each ethnic background using .the 
propionate number^ children selected by the WISC-R as the expected and the number selected by 
C Raven as the observed as was done in Table 1. Results were similar to those found for the children 

ethnic backgrounds except Asians and Native Americans (See Table 6). 



Table 6. Chi Square results using children certified "Seminar" with the WISC-R as the 
expected and children certified Seminar" with the Raven as the observed 



Ethnicity 

Latino/Hispanics 
Whites 

African- Americans 
Asians 

Native Americans 
Pacific Islanders 
Filipinos 

Indochinese 

» p < .05 
»» p < .01 
»»» p<.001 



df 



n (Expected) n (observed) Chi Square 



24 

430 

22 

26 

1.4 

1 

13 
4 



69 

300 

33 

33 

3 

4 

49 
29 



84.38*** 

38.79*** 

5.50* 

1.88 

1.83 

9.00** 

99.69*** 

156.25*** 
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As with the children who scored two standard deviations above the mean, there were inequities 
for children who scored three standard deviations above the mean. A total of 520 children in our 
sample scored in the 99.9th percentile on the Raven, as determined by local norms. We compared the 
number who actually scored at that level with the number expected based on district proportion for 
each ethnic group. Results, summarized in Table 7, indicate that Latino /Hispanics and African- 
Americans were underrepresented while Whites, Asians, and Indochinese were overrepresented. 



Table 7. Chi Square results comparing children certified "Seminar" using the Raven 
to their proportionate numbers in the district 



Ethnicity df n (Expected) n (Observed) Chi Square 



Latino/Hispanics 1 141 69 36.77*' 

Whites 1 193 300 59.32*' 

African-Americans 1 84 33 30.v6* 

Asians 1 12 33 36.75* 

Native Americans 1 ? 3 0.00 

Pacific Islanders 1 4 4 0.00 

Filipinos 1 42 49 1.17 

Indochinese 1 41 29 3.51 



* p < .05 
** p<.01 
*** p < .001 

Figure 3 illustrates the gender composition for children who scored either at or above the 
99.9 percentile on the Raven or had a Full Scale WISC-R IQ of 145 or greater. 



Figure 3. Gender composition of the 99.9th percentile: WISC-R versus Raven's Standard 
Progressive Matrices 
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As Figure 3 shows, the substantial gender imbalance in favor of males with the WISC-R was 
essentially eliminated with the RPM. Chi square analyses statistically demonstrated the reduction m 
gender bias at the highest levels of ability. Results are summarized in Tables 8 and 9 where actual 
numbers of boys and gM» who scored three standard deviations above the mean on J fce WISC-R and 
the Raven, respectively, were compared with the expectation of equal numbers of boys and girls. 



Table 8. Chi Square results comparing boys and girls certified "Seminar" using the 
WISC-R with the expectation of equal gender numbers 



Ethnicity 



Indochinese 



df Total n Boys (Observed) Chi Square 



29 0.35 



Latino/Hispanic 1 52 

White 1 925 570 22.47* 

African- American 1 47 26 • 

Asian 1 56 30 

» • 1 \ 2 0.17 

Native American \ i 

Pacific Islander 12 2 ■ 



* p < .05 
** p < .01 
*** p < .001 



Table 9. CM Square results comparing boys and girls certified "Seminar" using the 
Raven xoith the expectation of equal gender numbers 



Ethnicity 

Latino /Hispanic 
White 

African- American 
Asian 

Native American 
Pacific slander 
Filipino 
Indochinese 



df 



n (total) 


Boys (Observed) 


Chi Square 


69 


34 


0.007 


300 


156 


0.24 


33 


18 


0.14 


33 


15 


0.14 


3 


1 


1.16 


4 


3 


0.5 


32 


32 


2.30 


29 


15 


0.02 



p < .05 
p<.01 
p < .001 



r * 



1.2 
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The efficacy of the Raven was compared to that of the WISC-R for predicting achievement in a 
series of correlations between intelligence tests and achievement tests used by the school district. Each 
child received a test of intelligence and an achievement test during the same school year. Table 10 
demonstrates correlations between performance on the Raven, as expressed in z-scores, and on the 
subtests of the Comprehensive Test of Basic Skills (CTBS) for 1707 children, as well as correlations 
between CTBS and WISC-R for another 1925. As can be seen, the Raven correlated more highly with 
CTBS Total Language subscores than did the WISC-R for African-American and White children, but 
not for Latinos. In fact, for African-Americans no significant correlations were found between WISC- 
R scores and CTBS Total Language. Raven performance correlated more highly with CTBS Total Math 
scores than did WISC-R only for White children. For all three ethnic backgrounds, WISC-R VIQ scores 
were more highly correlated with CTBS Total Reading scores than were Raven scores. 



Table 10. Correlation(r) of Raven or WISC-R performance with achievement test 
(CTBS) performance for children of three ethnic backgrounds 



Subtest 



Whites 

RAVEN (n = 901) 
VIQ (n = 1566) 
PIQ (n = 1566) 
FSIQ (« = 1566) 

African-Americans 
RAVEN (« = 276) 
VIQ (n = 221) 
PIQ (n = 221) 
FSIQ (n = 221) 

Latinos/ Hispanics 
RAVEN (n = 530) 
VIQ (n = 138) 
PIQ (n = 138) 
FSIQ (« = 138) 



CTBSTL 


CTBSTR 


CTBSTM 


.1907** 


.1727** 


.2187* 


.1638** 


.2479** 


.1836** 


.1081** 


.1274** 


.1648** 


.1623** 


.2292** 


.2069** 


.2372** 


.1916** 


.1257* 


.0691 


.2740** 


.1440 


.0184 


.1126 


.1771** 


.0541 


.2406** 


.2002** 


.1563** 


.0797* 


.1327** 


.1899* 


.1365 


.3150** 


-.0476 


- .0196 


.1708* 


.0761 


.0725 


.2718** 



*p < .05 
**p< .01 
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Correlations between the Raven and the Abbreviated Stanford Achievement Test (ASAT) for 
children of six ethnic backgrounds are summarized in Table 11. Correlation <?^«^^tem 
2235 to .3109, with a median r = .25. Clearly, the Raven correlates more highly with the ASAT than i with 
the CTBS in all three domains measured (language, reading, and math). In addition, Raven-ASAl 
correlations were higher than WISC-R— CTBS correlations. 

Table 11 Correlation (r) of Raven performance with achievement as measured by the 

Abbreviated Stanford Achievement Test for children of six ethnic backgrounds 
(Raven performance was expressed in Z scores.) 







Total 


Total 


Total 


Ethnicity 


n 


Language 


Reading 


Math 


African-American 

Asian 

White 

Filipino 

Indochinese 

Latino 


1581 

305 

4020 

1002 

587 

2528 


.2478** 
.2386** 
.2245** 
.2235** 
.2582** 
.2468** 


.2639** 
.2654** 
.2252** 
.2438** 
.2484** 
.2981** 


.2695** 
.2634** 
.2494** 
.2429** 
.2584** 
.3109** 



**p< .01 

Discussion 

Our results lead to two clear conclusions. First, considering only children referred for giftedness 
testing the RPM produces far better equity for all ethnic backgrounds when compared to the WISC-R 
mSl tS RPM overselected Whites, it did so at a substantially reduced rate (i.e., 120 percent over 
^ecta^onwim the Raven vs. 200 percent over expectation with the WISC-R). Moreover, whde it did 
Xroduce complete equity, even when considering such risk factors as low socioeconomic or cultural 
SeCces the RPM led to substantially increased selection ratios for traditionary unrepresented 
^up Si* ^Latino/Hispanics and African-Americans. Moreover the ^j^^ 
Election for Native-Americans, Pacific Islanders, and Indochinese, all of whom had been 
underrepresented with the WISC-R. 

The success of the RPM with Indocliinese is of interest in terms of evaluating non-English 
speakme^ldrln In Repast the district had difficulties evaluating giftedness m Indochinese £uUtan 
whoSke little or no English. Since use of the RPM enabled evaluation of abihty ^P«^^ 
touTse it was possible to assess these children with the least bias hereto ore achieved With the RPM 
iTdoSse wen? selected almost exactly in proportion to their numbers in the district as a whole. 

The advantage of the RPM was not only evident in terms of producing a ^J^g*^ 
distribution but it was more effective than the WISC-R in predicting language achievement for African 
A^eric^d White children, as well as in predicting math achievement for Whites. Altiiough g Our 
™-used~ 

?T SC 7L?W™K co^te^ Wghlr*an the RPM with CTBS Total Reading for all groups 
H T TvtS for ^c^SSSand Latinos. It would be instructive to directly compare 
j^5£^£ZZ A^AT correlations in the same sample to more completely compare 
efficacy for predicting achievement. 

-pup Raven ' s a biW to predict achievement, even language achievement, is due to its high 
, ?uZ I T fl904 1927a 1927b) s factor (see Carpenter et al., 1990; Marshalek et al 

correlation with Speanr .u W 1* Ta^b ^ ^ ^ £ correlated with g . saturated 
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culture (Court, 1991). Moreover, since it does not depend on an explicit knowledge base, as does the 
WISC-R and other verbally weighted standardized tests, the RPM is better suited to traditionally 
underrepresented children. It must be emphasized, however, that the RPM is not simply for 
underrepresented children, as it led to a more equitable distribution across ethnicity, even for Whites. 

Previous studies mat have compared the WISC-R and the Raven in the selection of gifted children 
(eg., James, 1984; Pearce, 1983; Tulkin & Newbrough, 1968; Meeker, 1973; Kier, 1949) have been primarily 
concerned with the correlation between the two measures. Such studies have reported correlations in 
the .70's and have been generally supportive of the Raven as an alternative. Our findings suggest that 
where equity is a concern, the Raven is a far better measure. 

It should be noted, however, that the population of gifted children selected by the Raven is not 
identical to that selected by the WISC-R. The differences are important. First, there is the group of 
children who would be selected by either test. These children tend to be verbally advanced, highly 
intelligent, and high in achievement. Second, there is the group of children who would be qualified by 
a WISC-R, but not by a Raven. In our experience, such children tend to be verbally advanced and 
highly motivated. Teachers of academic gifted programs readily accept these children as gifted, especially 
in the early years. Due to the unreliability of IQ scores at the upper IQ levels, especially for the younger 
age levels, the IQ's of many of these verbally advanced children show regression to the mean as they 
mature. 

A third group of children are those who would not qualify with the WISC-R, but would with 
the Raven. Such children are of extremely high potential, but may be only average (or even below 
average) in achievement. Others may be of very high ability, but poorly motivated. Our experience has 
revealed that this type of student is not always accepted by teachers as gifted. Yet, it is this very type of 
child — the one with raw, undeveloped potential — for whom the present investigation was aimed. For 
programs using the Raven on a widescale basis, teacher training is often needed to help integrate these 
students into the gifted classroom. 

One of the reasons tests like the WISC-R continue to be used, in spite of their obvious selection 
bias, is that they are reliable and objective. To date, no other approach matches standardized tests m 
terms of reliability, predictive validity, and objectivity. Yet, given the huge selection bias inherent in the 
WISC-R, it is difficult to imagine how it, and others like it, can continue to be used in today's litigious 
environment and heightened awareness of civil rights. A simple examination of Figure 1 would find 
the WISC-R completely unacceptable as a tool to select for giftedness. 

The Raven clearly fared better in terms of equity without sacrificing objectivity, reliability, and 
predictive validity. Nevertheless, it is also clear that the Raven, even when used in conjunction with a 
consideration of risk factors, still falls short of producing equity across all ethnic groups. 

Based on our present findings, and on previous reviews of psychometric tests (Kaplan & 
Saccuzzo, 1993), we are convinced that as traditional tests are presently used, there exists not a single 
one that would produce an equitable selection for gifted programs. If the goal is to select children for an 
academic program using an objective, reliable measure with high predictive validity, then traditional 
tests as they are presently used, fall short in terms of equity. The search for multiple intelligences, as 
suggested by Gardner (1983), is, of course, one viable option. The use of portfolios and other subjective 
approaches, however, while promising on a small scale, faces numerous obstacles in terms of objectivity, 
reliability, and predictive validity. 

The question remains, are traditional tests beyond redemption? We believe that there remain 
potentially promising options. One such option, suggested by Raven (1989), is the use of local ethnic 
norms for selection purposes in gifted programs. A second, suggested by Carlson (1989) and Colleagues 
(Carlson & Dillon, 1978; Carlson & Wiedl, 1978), is to use traditional tests in creative ways. For example, 
usine a dynamic testing approach with the Raven, Carlson and Wiedl (1979) were able to ehminate 
Hispanic/ White and Black/ White differences in IQ. Before we abandon what remains the most objective, 
reliable and valid approach to selection, it behooves us to determine whether innovative uses can rescue 
traditional assessment procedures, or if they should be abandoned in favor of less psychometncaUy 
sound, but perhaps more equitable, approaches. 
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